Identifying diagnostic errors with induced decision trees.
نویسنده
چکیده
OBJECTIVE The purpose of this article is to compare the diagnostic accuracy of induced decision trees with that of pruned neural networks and to improve the accuracy and interpretation of breast cancer diagnosis from readings of thin-needle aspirate by identifying cases likely to be misclassified by induced decision rules. METHOD Using an online database consisting of 699 cases of suspected breast cancer and their corresponding readings of fine-needle aspirate, decision trees were induced from half of the cases, randomly selected. Accuracy was determined for the remaining cases in successive partitions. The pattern of errors in the multiple decision trees was examined. A smaller data set was created with 2 classes: (1) correctly classified and (2) misclassified by a decision tree, rather than the original benign and malignant classes. From this data set, decision trees that describe the misclassified cases were induced. RESULTS Larger, less severely pruned decision trees were more accurate in breast cancer diagnosis for both training and test data. The accuracy of the induced decision trees exceeded that reported for the smaller pruned neural networks. Combining classifications from 2 trees was effective in identifying malignancies missed by a single tree. Induced decision trees were able to identify patterns associated with misclassified cases, but the identification of errors inductively did not improve the overall error rate. CONCLUSION In this application, a model that is too compact identifies fewer cases of the minority class, malignancy. New methods that combine models and examine classification errors can improve diagnosis by identifying more malignancies and by describing ambiguous cases.
منابع مشابه
مطالعات درخت تصمیم در برآورد ریسک ابتلا به سرطان سینه با استفاده از چند شکلیهای تک نوکلوئیدی
Abstract Introduction: Decision tree is the data mining tools to collect, accurate prediction and sift information from massive amounts of data that are used widely in the field of computational biology and bioinformatics. In bioinformatics can be predict on diseases, including breast cancer. The use of genomic data including single nucleotide polymorphisms is a very important ...
متن کاملInduced Decision Trees for Temporal Medical Data
Patient diagnosis and treatment involves factors that occur at different times. In order to identify as early as possible those factors associated with excessive length of stay in a hospital, time staging was added to induced decision trees. Time-staged induced decision trees uncovered useful patterns in data from the earliest time period, which represents a patient’s medical history, even thou...
متن کاملA New Algorithm for Optimization of Fuzzy Decision Tree in Data Mining
Decision-tree algorithms provide one of the most popular methodologies for symbolic knowledge acquisition. The resulting knowledge, a symbolic decision tree along with a simple inference mechanism, has been praised for comprehensibility. The most comprehensible decision trees have been designed for perfect symbolic data. Classical crisp decision trees (DT) are widely applied to classification t...
متن کاملFeature Selection for Tool Wear Diagnosis Using Soft Computing Techniques
This paper examines feature selection methods in the context of milling machine tool wear diagnosis. Given raw sensor signals acquired during experiments, a pool of features was created through calculation by several feature extraction methods. Five techniques for selecting the most discriminating features were employed. These techniques included decision trees, neuralfuzzy methods, scatter mat...
متن کاملI-22: Decision Trees for Identifying Predictor of Treatment Effectiveness in Clinical Trials and Its Application to Ovulation in a Study of Women with Polycystic Ovary Syndrome
Background: Double-blind, randomized clinical trials are the preferred approach to demonstrate the effectiveness of one treatment against another. The comparison is, however, made on the average group effects. While patients and clinicians have always struggled to understand why patients respond differently to the same treatment, and while much hope has been held out for the nascent field of pr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Medical decision making : an international journal of the Society for Medical Decision Making
دوره 21 5 شماره
صفحات -
تاریخ انتشار 2001